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ABSTRACT 

We present a catalogue of starless and protostellar clumps associated with infrared dark clouds 
(IRDCs) in a 40° wide region of the inner Galactic Plane {\b\ < 1°). We have extracted the 
far-infrared (FIR) counterparts of 3493 IRDCs with known distance in the Galactic longitude 
range 15° < l < 55° and searched for the young clumps using Hi-GAL, the survey of the 
Galactic Plane carried out with the Herschel satellite. Each clump is identified as a compact 
source detected at 160, 250 and 350 gm. The clumps have been classified as protostellar or 
starless, based on their emission (or lack of emission) at 70/rni. We identify 1723 clumps, 
1056 (61%) of which are protostellar and 667 (39%) starless. These clumps are found within 
764 different IRDCs, 375 (49%) of which are only associated with protostellar clumps, 178 
(23%) only with starless clumps, and 211 (28%) with both categories of clumps. The clumps 
have a median mass of ~250M o and range up to > 10 4 M o in mass and up to 10 5 L o in 
luminosity. 

The mass-radius distribution shows that almost 30% of the starless clumps identified in 
this survey could form high-mass stars, however these massive clumps are confined in only 
4% of the IRDCs. Assuming a minimum mass surface density threshold for the formation of 
high-mass stars, the comparison of the numbers of massive starless clumps and those already 
containing embedded sources suggests an upper limit lifetime for the starless phase of ~ 10 5 
years for clumps with a mass M> 500 M Q . 


1 INTRODUCTION 

The star formation process begins in massive clouds with a reser¬ 
voir of gas and dust sufficient to sustain the creation of a cluster 
of stars (e.g. lLada & L ada 2003), which in some clouds may even¬ 
tually form high-mass stellar objects (e.g., Rathborn e et al.ll2006l : 
IPeretto & Fullen 120091) . Some of these dense, cold clouds which 
are not yet dominated by star formation absorb the IR emission of 
the background radiation and therefore can be observed as dark 
struct ures in the mid-IR images (infrared dark clouds, IRDCs, 
e.g. ISimon et al.l f2006tj . These relatively undisturbed clouds are 
favoured places to study the very early stages of s t ar formation 
i Peretto & Fulled 1 20091 l20ld : iBattersbv et al.l 1201 ll : iRagan et al.l 
120121 ) . 

Within these clouds, the earliest stages are initially charac¬ 
terised by the formation of a starless, dense clump with a size of 
= 1 pc. The cold dust envelope of these clumps emits in the far-IR 
(FIR)/sub-mm re gion of the spectrum, but are still IR-quiet at wave¬ 
length s < 70 /j tn lElia et al.l20 1 (] ; iMotte et al.l20l(jl ; Iciiannini et al.l 
12013 ) . As a protostellar core and then protostar eventually forms 
within the gravitationally bound starless clumps (the pre-stellar 
clumps) it heats the surrounding envelope. The now protostellar 
clump becomes visible at wavelengths < 70 pm and its bolomet- 
ric luminosity increases, sustained by the warm inner core(s). As 


a consequence, the protostellar luminosity correlates well with the 
clump emission observed at 70 pm (iDunham et alj|2008l) . 


Large surveys of the Galactic Plane in the FIR-sub-mm region 
allow us to make a census of these clumps and the young stellar ob¬ 
jects (YSOs) within them across the Galaxy. Such surveys provide 
the significant samples of sources required to understand the star 
formation mechanisms across a wide range of mass and luminosity 
regimes as well as identify rare and/or short-lived classes of ob¬ 
jects. Several surveys have been carried out in the past few years at 
increasing sensitivity and spatial resolution in order to build a sta¬ 
tistically significant sample of young stellar objects in the Galactic 
Plane. The APEX Te lescope La rge Area Survey of the Galaxy (AT- 
LASGAL, I Schuller et alj 20091) and the Bolocam Galactic Plane 
Survey (BGPS, Aguirre et al.ll201 ll) have mapped the inner Galac¬ 
tic Plane in the sub-mm range, at 870 pm and 1.1 mm respectively, 
allowing a census of the sub-mm thermal emission from high-mass 
regions. However, unprecedented opportunities to make a multi- 
wavelen gth FIR/sub m m stud y of the sky arrived with the Herschel 
satellite jpilbratt et al J 120 loh . opening a new window of our un¬ 
derstanding of the cold universe and, in this context, Galactic star 
formation. For example, the earliest phases of star formation asso¬ 
ciated with IRDCs have bee n studied with t he EPoS Herschel key 
program, for both low-mass jLaunhardt et al.120131) and high-mass 
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objects ( iRagan et alj2012l) . This survey studied a sample of 12 low- 
mass and 45 high-mass regions respectively in order to characterise 
the temperature and column densities of the cores and clumps at 
different mass regimes. 

A comprehensive Herschel mapping of the Galactic cold 
and dusty regions has been recently completed with the He rsche l 
infrared Galactic Plane survey (Hi-GAL, Moli nari et al.ll201()hh . 
which has mapped the whole Galactic Plane at latitudes \b\ < 1° and 
following the Galactic warp in the wavelength range 70 < A < 500 
pm. 

The aim of this work is to produce a first extensive cata¬ 
logue of young clumps embedded in IRDCs combining the Hi- 
GAL data_with_themostcomprehensive catalogue of IRDCs to 
date dPeretto & F uller 2009, hereafter Paper I). This initial cata¬ 
logue looks in a specific region of the inner Galactic Plane, in the 
longitude range 15° < / < 55°, which encompasses ^ 3500 IRDCs 
with known distance. 

The paper is structured as: in Section [2] describes the IRDCs 
dataset extracted from Paper I and the Hi-GAL data used to extract 
the FIR/sub-mm IRDCs counterparts; the source extraction and the 
catalogues generation are described in Section[3] In this Section we 
also briefly describe Hyper, a new algorithm used in this work and 
developed for source extraction a nd photom etry in complex back¬ 
grounds and crowded fields dTraficante et al.l2015h . In Section[4]we 
describe the statistical distributions of the starless and protostellar 
clumps properties. Finally, in Section[5]we draw up our conclusions 
about this first release of clumps in IRDCs. 


2 IRDC DATASET 

The IRDCs survey produced in Paper I contains = 11000 IRDCs 
identified in absorption against the warm background in the 
GLIMPSE 8 pm survey of the Galactic Plane l lBeniamin et al.i 
120031 ) delimited by |/| ^ 65°, \b\ < 1°. The IRDCs have been iden¬ 
tified as connected regions with column density higher than Nh 2 
> 1 x 10 22 cnr 2 and a diameter greater than 4" (Paper I). 

We focused on a subsample of 3659 IRDCs observed in the 
region 15° < / < 55°, |Z?| < 1°. This region has been selec ted 
for its overlap with the Galactic Ring Survey (GRS. [jackson et ahl 
120061 ). The GRS emission along the IRDCs line of sight (LOS) has 
been used to identify clouds and to estimate the IRDCs kinematic 
distances. The GRS survey mapped the 13 CO J= 1-0 emission with 
the FCRAO 14m telescope with an angular resoluti on of 46" and a 
spectral resolution of 0.212 km s' 1 djackson et alj2006h . To obtain 
kinematic distances, we developed an automated procedure that ex¬ 
tracts the 13 CO spectrum towards the centre of each IRDC, identi¬ 
fies the number of velocity components in it, integrates each com¬ 
ponent in a 2 km/s interval around the central velocity, and then cal¬ 
culates the ratio map for each of the integrated intensity maps with 
the Herschel column density, smoothed at 46" resolution, centred 
at the IRDC position. We then select the velocity component for 
which the ratio map is the flattest, i.e. with a minimum dispersion. 
In some cases, no 13 CO(1-0) components were found, and a -999 
flag was returned for the velocity. We identified 166 clouds with 
a -999 flag, therefore our starting IRDC catalogue is composed of 
3493 clouds. This procedure is simple, and rather easy to imple¬ 
ment for such a large number of sources . Kinematical distances 
were then calculated using the lReid et alj (1 20091) Galactic rotation 
model, always assuming the clouds located at the near distance in 
case of a distance ambiguity. 


2.1 IRDCs Hi-GAL counterparts 

The Hi-GAL surve y dMolinari et alJ 2010b) has been c arried out us¬ 
ing b oth the PACS dPoglitsch et alj|2010l) and SPIRE ^Griffin et~af 
2010) photometry instruments on-board Herschel JPilbratt et al. 
20101) in parallel mode, observing the sky at five wavelengths si¬ 
multaneously (70 and 160 pm with PACS and 250, 350 and 500 
pm with SPIRE). The maps ha ve been reduced with the ROMA- 
GAL pipeline dTraficante et al.ll201 ll) . an enhanced version of the 
standard Herschel pipeline specifically designed for Hi-GAL. A 
weighted post-processing on the maps has been applied to help with 
image artefact removal dPiazzo et alj|201ll) . The maps have been 
flux calibrated (by means of an offset subtraction) following the 
prescription of iBemard et al J d2010i ) and are expressed in MJy/sr. 
Due to the fast scan-speed and the co-addition on-board Herschel 
of 8 samples at 70 pm and 4 samples at 160 pm, the measured PACS 
beams in the maps are slightly larger than the nominal ones. The 
Hi-GAL beams measured on the maps are 10.2", 13.55", 18.0", 
24.0" and 34.5" at 70, 160, 250, 350 and 500 pm respectively. The 
map pixel size is 3.2", 4. 5", 6.0", 8.0", 11.5" a t 70, 160, 250, 350 
and 500 pm respectively dTraficante et all201 ll) . 

For each IRDCs in our sample we selected the correspond¬ 
ing region in the Hi-GAL 70, 160, 250 and 350 pm maps. We do 
not include the 500 pm maps in the analysis due to the poor beam 
resolution compared to the other wavelengths. 

Some of the IRDCs of ou r sample ha ve been already identi¬ 
fied in the Hi-GAL survey by lPeretto et alj d2010l) . In this work the 
authors showed that the IRDC FIR extension is slightly bigger than 
their appearance at 8 and 24 pm. Therefore, for each IRDC, we 
isolate an Hi-GAL region which enlarged the 24 pm IRDC size by 
the equivalent of a 70 pm beam (10.2") in each spatial direction. 
We assume that all the sources extracted between these extended 
boundaries belong at the same IRDC. Due to the boundaries exten¬ 
sion and the proximity of some IRDCs, some sources can be asso¬ 
ciated with two different clouds. In these cases we associate each 
source with one IRDC randomly selected from the two clouds. 


3 CLUMP EXTRACTION AND PHOTOMETRY 

Source extraction and photometry is a well known problem in as¬ 
tronomy, in particular in complex fields such as the Galactic Plane. 
The high resolution and sensitivity of the new IR instruments have 
shown the complexity of the sites in which the newly born stars 
are Jocated (such as the filamentary structures, e.g. iMolinari et al .1 
!2010al) . In particular the background variability can be significant 
in FIR Galactic Plane data, especially at l onger wavelengths (e.g., 
IMolinari et alj l2010al ; IPeretto et alj 120 ldh . and models must ac¬ 
count for its high variations across each source. Furthermore the 
Galactic Plane is dense and crowded and it is often the case that 
sources are partially blended together. The problem is further com¬ 
plicated in case of multi-wavelengths study since each band has a 
different spatial resolution and often single sources are resolved in 
multiple objects in the high-resolution maps. 

Various approaches based on di fferent techniques (e.g. Cu - 
tex, Molinari et al.l d201 ll) : getsources, Men’shchikov et al] d2012l) ) 
have been developed specifically for the new Herschel FIR data, 
in particular for the Galactic surveys. Cutex in particular is the 
standard source extraction algorithm used by the Hi-GAL team. 
It identifies the compact source in the second derivative image of 
the sky, and fits the sources with a 2d-Gaussian model. In this ap¬ 
proach the physical source diameter, FWHM rffc , also needed to con¬ 
straint the source flux, is evaluated through the deconvolution of 
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the HPBW with the FWHM A of the source m easured at each band 
and it is therefore wavelength-depende nt fe.g. lMolinari et al. 1 20 111 : 
lElia et al 1 l20ld : IVeneziani et al.ll2013h . 

We decide to adopt a different approach and to estimate the 
flux consistently within the same volume of gas and dust at all 
wavelengths regardless of the different spatial resolutions. A simi¬ 
lar approach have been adopted bv lOlmi et al.l J201 3h to study the 
C1MF in Hi-GAL fields. 

For this purpose we used a new algorithm, Hyper (HY- 
brid Photometr y and Extraction Routine), fully described in 
iTraficante et al.l J2015I) . It has been designed specifically to take 
into account the complexities generated by the new datasets such 
as the Galactic Plane observations made with Herschel, in particu¬ 
lar the high background variability and the source crowding. Hyper 
is based on a hybrid approach between the classical aperture pho¬ 
tometry and a 2d-Gaussian modelling of the sources. In the case 
of multi-wavelength analysis, the region of integration is defined 
at a particular wavelength and the same region of the sky is used 
to integrate the flux at all wavelengths. In the following Sections, 
along with the main results of this work we will describe the Hyper 
parameters tuned for this source extraction and photometry. 

3.1 The source catalogues 

The sources are initially identified at each wavelength separately on 
a modified high-pass filtered map usin g in sequence the find and 
gcnrtd IDL routines ( ITraficante et al. 120150 . These routines fit a 
Gaussian model to recognize the peaks above a given threshold cr, 
defined as a multiple of the r.m.s. of the filtered map, 07. A ref¬ 
erence threshold value for each Herschel wavelength, which min¬ 
imizes the false positi ves and maximizes the identification of real 
sources is provided in lTraficante et alj J2OI5I1 . However the source 
recovery for a given value of cr, depends on variations of the (local) 
background, the source crowding and the cloud size, which differ 
from cloud to cloud (see also the discussion in Section l4.1.1l l. We 
tested several valu es of cr, around the reference values discussed 
in lTraficante et alj i2015t ) and after visual inspections of randomly 
selected IRDCs we set a - f =[6.0,5.0,4.0,4.0]'O"/ at [70,160,250,350] 
pm respectively. These thresholds are a conservative compromise 
between source recovery and false identifications. From our visual 
inspections, less than 1% of the sources appear as false positives. 
The completeness of our catalogue based on this extraction is dis¬ 
cussed in Section l4.1.11 

From this extraction we produced a catalogue of sources in¬ 
dependently identified at each wavelength. Hyper identified 8220, 
5393, 4967, 3413 sources at 70, 160, 250 and 350 pm respectively. 
They are associated with 2070, 1640, 1621, 1246 IRDCs, respec¬ 
tively = 59%, - 47%, =: 46% and = 36% of the whole sample. The 
high number of 70 pm sources compared to the other wavelengths 
is likely due to the high spatial resolution of the 70 pm data, which 
allows the resolution of close sources possibly unresolved at longer 
wavelengths. Conversely, the low spatial resolution of the 350 pm 
band blends sources resolved at shorter wavelengths. The longitude 
distribution of sources at 70, 160 and 350 pm is in Figure [I] The 
250 pm source distribution is very similar to that of the 160yum 
sources and is not showed. 

The multi-wavelength catalogues contain the photometry of 
the sources observed at all the specified wavelengths, as discussed 
in the next Sections, starting from a reference wavelength used to 
identified the candidates. The reference wavelength is 160 pm, the 
highest resolution wavelength that is in common among protostel¬ 
lar and starless clumps. We initially generated a merged catalogue 



Figure 1. Longitude distribution of sources independently identified at 70 
(brown), 160 (green) and 350 (blue) pm. The distribution of 250 p m sources 
is very similar to the 160 pm distribution and it is therefore not shown. 

of 160, 250 and 350 pm sources to produce a list of starless and 
protostellar clump candidates in our selection of IRDCs. At the lo¬ 
cation of each 160 pm source, a 250 and/or 350 pm counterpart is 
associated if a source is identified within a radius equal to half the 
160 pm beam, 6.7". The merged catalogue contains 1723 clumps 
associated with 764 different IRDCs (=: 22%). 

A fraction of IRDCs identified in Paper I may not have a coun¬ 
terpart in the Hersc hel wavelengths and may not be real clouds 
(e.g. IWilcock et alj|20f3 ), however the small fraction of IRDCs 
with identified Herschel compact sources here is due to a com¬ 
bination of factors. First, the merged catalogue does not consider 
sources detected at only one or two wavelengths, nor objects de¬ 
tected at 160, 250 and 350 pm but with centroids separated by more 
than 6.7" from the 160 pm centroid. Second, most IRDCs smaller 
than = 20" are too small to be associated with a compact Herschel 
sources. Note that in particular with this choice we are not includ¬ 
ing in the catalogues the sources observed at 250 and 350 pm with 
no counterparts at 160 pm. We identified ^ 400 sources observed 
at 250 and 350 pm only. Some of these sources are potentially ex¬ 
tremely young, their envelope so cold and/or diffuse that they do 
not emit above the background at wavelengths A 160 pm. How¬ 
ever, we cannot extract physical parameters for sources identified 
at only two wavelengths, therefore we restrict the analysis to the 
clumps already visible at 160 pm. A multi-wavelength analysis of 
these very cold clumps is the subject of a subsequent work. 

An independent extraction run used to produce a list of 70 pm 
sources has been combined with the previous sample to produce 
two final catalogues: 

1. Sources identified at 160, 250 and 350 pm but without coun¬ 
terparts at 70 pm. These sources are classified as starless clumps. 
The catalogue contains 667 clumps associated with 389 IRDCs. 
The IRDCs and source longitude distributions are shown in Fig- 
ures[2]and[3]respectively. 

2. Sources identified at 160 , 250 and 350 pm with (at least) 
one counterpart at 70 pm. A 70 pm source is associated with each 
clump if its distance from the centroid of the 160yum source is less 
or equal to half the 160 pm beam, 6.7". These sources are classi¬ 
fied as protostellar clumps. This catalogue contains 1056 sources 
associated with 586 IRDCs. The IRDC and source longitude distri¬ 
butions are shown in[4]and[5]respectively. 
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Figure 2. Longitude distribution in the range 15° < / < 55° of IRDCs with 
at least one starless clump (389, blue) together with the distribution of the 
IRDCs with at least one 160 pm detection (1640, green) and the distribution 
of the total number of IRDCs in our catalogue with known distances (3493, 
brown). 
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Figure 3. Longitude distribution of sources. In green the 160pm sources 
(5393) and in azure the starless clumps (667). 


In total 211 clouds 6% of the total) contain both starless 
and protostellar clumps. 

Figures 1-5 show that the majority of the sources are located 
in the longitude range 25° < / < 30°, and a second peak in the 
source distribution occurs around / = 50°, for both starless and 
protostellar clumps. The longitude distribution of the clumps as a 
function of their distance is shown in Figure[6] The clump distances 
are assumed equal to the distance of their parent IRDCs. The me¬ 
dian distances are d = 4.2 kpc for both the starless and protostellar 
clumps. 

The distribution of clumps is sparse across the Galaxy with 
two LOS which contain the majority of the clumps and of the 160 
pm sources. The region around 25° < l < 30° corresponds to 
sources mostly located at a distance d ^ 5 kpc, which is where 
the LOS p asses the tangent point across the Scutum-Crux arm (fol¬ 
lowing the lRusseU 120031) spiral Galaxy model). Being close to the 
tangent point, the distance ambiguity is minimal for these sources 
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Figure 4. IRDCs longitude distribution of IRDCs with at least one proto¬ 
stellar clump (586, red). The other bars are as shown in Figure[2] 
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Figure 5. Longitude distribution of protostellar clumps in light red (1056). 
The other bar is as shown in Figure[3] 


and there are likely to be associated with the Scutum-Crux arm. 
The source distribution along this LOS is not dominated by starless 
or protostellar clumps. The peak around / = 50° corresponds to the 
tangent point of the Sagittarius-Carina arm, but passes also through 
the Perseus and the Norma-Cygnus arms. Some of the clumps in 
this region come from the furthest IRDCs present in our catalogue 
(see Figure |6j. located at ~ 10 kpc and are likely to be associated 
with the Perseus arm. The peak at / = 50° in the source distribu¬ 
tion is more evident in the starless clumps. Although some 70 pm 
sources could be too faint to be observed in the furthest clouds, 
the mean IRDCs distance around / = 50° is similar to the mean 
distance of the clouds around I = 30°, 5 kpc and 4.6 kpc respec¬ 
tively. Therefore, the peak at / = 50° in the starless distribution is 
likely to be a sign of younger star-forming region along this LOS 
compared to the Scutum-Crux region. The distribution in Figure [6] 
indicates that the Hi-GAL sources are mainly located on the Galac¬ 
tic arms, although j>ome are in interarms regions, as already noted 
bv lRusseil et al.l d201 ill . 
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Figure 6. Starless and protostellar clump longitude distribution as function 
of their parent IRDC distance. The two distributions look similar and there 
are no regions with a clear predominance of starless or protostellar clumps. 
The majority of the sources are located around 25° < l < 30° and / = 50°. 
The median distances of the two distributions are d = 4.21 kpc and d = 4.23 
kpc for starless and protostellar clumps respectively. 


+ Storless 
+ Protostors 





3.2 Photometry 

The flux of each source is estimated by integrating the emission 
within the same elliptical region at all wavelengths. The semi-axes 
a and b of the elliptical aperture are equal to the FWHM C of the 
2d-Gaussian fit to each source estimated at a reference wavelength, 
which can b e different from the wa velength used to initially identify 
the clumps ITraficante et all2015h . For the analysis here we set the 
reference wavelength to A = 250 /jm. The FWHM G of each fit can 
vary from a minimum of 1 • FWHM.i (18" at A = 250 pm) up to 
2 • FWFIMj (36"). This size at A = 250 /im encompasses a region 
of at least 1.5 ■ FWHM, at A = 350 p m within the integration area. 
The range of allowed FWHMc has been chosen in order to account 
for point-like and slightly elongated compact sources and, at the 
same time, to avoid highly elongated fits likely contaminated by the 
underlying filaments aligned with some of the sources. The profile 
at /l = 250 p m also defines the source size as described in Section 

14H 

We note that within each elliptical aperture the clump could 
be resolved in multiple objects at 70 and 160/mi. In these cases, 
the closest source in the cluster is identified as the counterpart and 
we assign each clump the 70 and 160,/mi flux arising from all the 
sources within the elliptical integration area. This choice is con¬ 
sistent with our approach of evaluating the flux within the same 
area, since the sources are blended within the beam at longer wave¬ 
lengths and they all contribute to the observed emission. In the 
case of these resolved clusters, a specific keyword in the catalogue 
(“cluster”) indicates the number of multiple sources observed at 70 
and 160/im within each integration area. 

The source flux evaluation is done at each wavelength after the 
background emission is subtracted and, in case of blended sources, 
the flux arising from the companions is removed. Hyper evaluates 
the local background by selecting various square regions across 
each source and modelling the emission in each region with polyno¬ 
mial functions of different orders (from the zeroth up to the fourth). 
The fit which produces the lowest r.m.s. of the residuals is assumed 
as best fit and subtracted as background contribution. 

Two (or more) sources are considered blended if their cen¬ 


troids are closer than a fixed distance. This distance is automati¬ 
cally evaluated by the algorithm and it is equal to twice the maxi¬ 
mum allowed FWHM G (e.g. 72"), the maximum distance at which 
two elliptical apertures can be partially overlapped. Each source 
and its blended companions are fitted simultaneously with a rnulti- 
Gaussian function using the mpfit IDL routine dMarkwardtll2009tl . 
The parameters of the fit are used to build 2d-Gaussian mod¬ 
els of the companions which are then subtracted from the image. 
The flux of the companion-subtracted source is then evaluated as 
in the isolated source case. A detailed description of the back¬ 
ground subtraction and source de-blending strategies can be found 
in ITraficante et al.l 120151) . However the source fluxes still require 
to be corrected for two factors: the aperture and colour corrections. 


3.2.1 Aperture corrections 

Aperture corrections A c are needed in aperture photometry to com¬ 
pensate for the flux distributed outside the integration area due to 
the convolution of the true sky signal with the instrumental beam, 
the PSF response. These corrections therefore depend on the beam 
at each different wavelength and, for point-like sources, can be esti¬ 
mated by measuring the source emission of reference sources with 
known flux. A further correction factor has to be applied for ex¬ 
tended sources. 

For PACS, the aperture corrections are known for circular 
apertures as function of aperture radiufl The equivalent aperture 
radius r eq of each source, described in Section |4~T1 is used to esti¬ 
mate the corresponding A c . 

For SPIRE, aperture corrections are known for circular aper¬ 
tures as function of aperture radius and source spectral inde)£]. The 
corrections are estimated at fixed aperture radii of 22", 30" and 40" 
at 250, 350 and 500 pm respectively. Our aperture radius depends 
the 2d-Gaussian fit at A = 250 pm to each source and it is fixed 
at each wavelength (Section 13.21 . However, the range of allowed 
FWHM G is from 18" to 36", therefore we decided to adopt the 
proposed corrections without further assumptions and assuming a 
spectral index of f3 = 2.0 (as discussed in Section[4}. 


3.2.2 Colour corrections 

Colour corrections are needed to convert the flux density measured 
by PACS and SPIRE into the monochromatic flux density for each 
observed object, which depends on the intrinsic temperature and 
spectral energy distribution (SED) of the source. The colour correc¬ 
tions are available online for both PACS0and SPIRF0 instruments. 
The colour corrections have been measured for different spectral in¬ 
dexes and blackbody (and greybody) temperatures for both PACS 
and SPIRE instruments. We adopted a fixed spectral index /3 = 2.0 
(see Section 0. The PACS colour correction curves extrapolated 
for temperatures in the range 10 < T < 30 K for both the PACS 
70 and 160 filters are shown in Figure|7] In order to extrapolate 
the colour corrections for our sample however we need to know the 


1 http : //herschel.esac.esa.int/twiki/pub/Public/PacsCalibrationWeb/ 
pacs_bolo_fluxcal_report_v 1 .pdf 

2 http : //herschel.esac.esa.int/twiki/pub/Public/PacsCalibrationWeb/ 
http : //herschel.esac.esa.int/Docs/SPIRE/spire_handbook.pdf 

3 http : //herschel.esac.esa.int/twiki/pub/Public/ 
PacsCalibrationWeb/cc_report_v 1 .pdf 

4 http : //herschel.esac.esa.int/twiki/pub/Public/PacsCalibrationWeb/ 
http : //herschel.esac.esa.int/Docs/SPIRE/spire_handbook.pdf 
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Figure 7. PACS colour correction curves for both the 70 and 160 pm filters, 
extrapolated from the colour correction values at T=10, 15, 20 and 30 K de¬ 
scribed in the PICC-ME-TN-038 PACS report. The green and black crosses 
show the colour corrections applied for the starless and protostellar clumps 
flux distributions respectively. Their values are 1.307 and 0.946 for starless 
and 1.223 and 0.955 for protostellar clumps at 70 and 160 /im respectively, 
corresponding to a temperature of T=15.52 K and 17.17 K for starless and 
protostellar clumps respectively. 

grey body temperature in advance and this poses a circular prob¬ 
lem IPezzuto et alJfioij ). So we estimate the colour corrections 
following an iterative procedure. We first fixed the colour correc¬ 
tions assuming T=15 K for both starless and protostellar clumps, 
in agreem ent w ith previous estimations of their envelope tempera¬ 
ture (e.g. IVeneziani et ail 1 201 3ll . We obtained a first estimation of 
the greybody temperatures of our sources resulting in median tem¬ 
perature values of T=14.37 K and T=15.84 K for starless and pro¬ 
tostellar clumps respectively. These new values were then used to 
estimate the first iteration of the colour corrections from the curve 
in Figure[7] The SEDs obtained with these new corrections give me¬ 
dian temperature values of T=15.52 K and 17.17 K for starless and 
protostellar clumps respectively. Adopting the colour corrections 
for these temperatures resulted in median temperatures of T= 15.49 
K and T=17.13 K for starless and protostellar clumps respectively, 
sufficiently close to the previous estimate, that no further iterate was 
carried out. The adopted colour corrections are 1.307 and 0.946 for 
starless and 1.223 and 0.955 for protostellar clumps at 70 and 160 
pm respectively. The SPIRE colour corrections do not vary signif¬ 
icantly in the range 10-20 K at p = 2.0. Therefore, we adopted the 
publicly available values assuming T=15 K for both starless and 
protostellar clumps. The values are 1.223 and 0.955 for the 250 
and 350 pm fluxes respectively. 


3.2.3 Flux error 

The quoted error for the flux in the PACS and SPIRE reference 
manuals is, after aperture and colour corrections, < 5%. This er¬ 
ror has to be combined with the error associated with the Hyper 
photometry. The error quoted in the catalogues is the flux error es¬ 
timated as the local sky r.m.s. per pixel multiplied for the num¬ 
ber of p ixels in the area over which the source flux is integrated 
dTraficante et al.ll2015l) . In addition, the precision in recovering the 
source flux depends on the intensity of each source and the local 
background variability. For each source we assume a conservative 


error of 5% of its flux associated with the aperture and colour cor¬ 
rections plus an error of 15% from Hyper measurements. Hence 
each source flux has associated a conservative error of 20% of its 
flux at every wavelength. 

3.3 The clumps catalogues 

An example of source extraction and photometry at the four 
considered wavelengths in the SDC23.271-0.263, catalogued as 
HGL23.271-0.263, is shown in Figure [8] Hyper identified seven 
sources observed at 160, 250 and 350 pm simultaneously. Six 
sources have a 70 pm counterpart and are therefore classified as 
protostellar clumps, one is classified as starless clump. The 2d- 
Gaussian fits of the clumps evaluated at 250 pm are shown in 
green and blue for starless and protostellar clumps respectively. The 
sources identified at 160 pm are indicated as black crosses in the 
160 pm map. 

The catalogue parameters obtained for the sources extracted in 
HGL23.271-0.263 are listed in TableQ] The quantities reported for 
each source are: the IRDC name and the source identification num¬ 
ber as extracted from Hyper (Columns 1-2), which corresponds to 
the number in the associated region file; the wavelength (Column 
3); the source peak flux in MJy/sr and Jy, evaluated at the centroid 
pixel (Columns 4-5); the source integrated flux (not corrected) and 
its associated error (Columns 6-7); the sky r.m.s. before and after 
the background subtraction (Columns 8-9); the polynomial order 
used to estimate the background (Column 10); the source semi¬ 
axes and the position angles as obtained from 2d-Gaussian fitting 
at the reference wavelength, 250 pm (Columns 11-13); the conver¬ 
gence status of the fit (Column 14). This status is 0 if the fitting 
routine converges, otherwise it assumes the value -1 if the source 
fit is too elongated (axes ratio > 2.5) or -2 if the fitting routine 
did not converge. In both these cases the code forces the semi-axes 
to be equal to the average of the minimum and maximum FWHM 
possible values (i.e., 1.5 • FWHMasiy™); the position of the source 
centroids at each different wavelength, in both Galactic and Equato¬ 
rial (J2000) coordinates (Columns 15-18); the distance between the 
160 pm centroid and the centroids of the source counterparts at the 
other wavelengths (Column 19); the number of sources with their 
2d-Gaussian profile overlapped and de-blended before the source 
flux integration (Column 20). The number of multiple sources re¬ 
solved at wavelengths other than the 250 pm within the integration 
area (Column 21). 
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Figure 8 . The FIR counterpart of the IRDC catalogued as HGL23.271-0.263. From top to the bottom: 70, 160, 250 and 350 //m images. The protostellar 
clumps are shown in blue and the starless clumps are shown in green. The ellipses correspond to the elliptical region, defined at 250 /im, used to estimate the 
source fluxes at all wavelengths. The black crosses in the 160 fim image shows the sources initially identified in this map. 
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Table 1. Output parameters for protostellar and starless clumps extracted in HGL23.271-0.263. The catalogues for protostellar (sources 1-6) and starless clumps (source 7) are separated by a blank line in this table. 
Column 1: IRDC name. Column 2: Hyper source number. Column 3: wavelength. Columns 4-5: source peak flux in MJy/sr and Jy. Columns 6-7: source integrated flux and source flux error in Jy. This source flux 
is not corrected for aperture or colour corrections. The flux error is estimated from the local sky r.m.s. multiplied by the number of pixels in the area over which the source flux is integrated. Column 8-9: r.m.s. of 
the sky evaluated in the rectangular region used to model the background before and after the background subtraction respectively. Column 10: polynomial order used to model the background. Columns 11-12: 
minor and maximum FWHM of the 2d-Gaussian fit. Column 13: Source Position Angle. Column 14: goodness of the 2d-Gaussian fit. Status is equal to 0 if the fit has regularly converged. Columns 15-18: source 
centroids Galactic and Equatorial coordinates (J2000). Column 19: distance from the source centroids in the reference wavelength and the source counterparts at the other wavelengths. Column 20: number of sources 
identified as companions and de-blended. Column 21: number of source counterparts at each wavelength. It normally is equal to 1 but it can be higher if the source is resolved in more than one counterpart in the 
high resolution maps. 
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4 STARLESS AND PROTOSTELLAR CLUMPS 
ANALYSIS 


The source photometric properties and the source distances are 
combined to build the SEDs and estimate equivalent radius, tem¬ 
perature, mass and FIR luminosity. We assume a single temper¬ 
ature greybody model with a fixed spectral index to describe the 
emission of the cold dust envelope of both starless and protostel¬ 
lar cl umps. We model the flux of each sou rce S y at frequency v as 
(e.g.. lElia et al.ll2oIol : lGiaiinini et all2012h : 


Sy 


Mkq ( V 
d 2 (vo 


B V (T)Q. 


0 ) 


constraining the SED with only three points. The majority of the 
sources have good fits however, and the final catalogue contains 
649 out of a total of 667 starless clumps and 1030 out of 1056 pro¬ 
tostellar clumps. 

Figure [9] shows the SEDs for the starless and protostellar 
clumps observed in HGL23.271-0.263. The plots for the protostel¬ 
lar clumps also show the emission at 70 /im, which is in excess of 
the SED fits in all cases. Each SED plot reports the best-fit parame¬ 
ters, the distance of the source and the value of the^ 2 . These values 
are also reported in Table[4] 

4.1 Radius, temperature and mass distributions 


where v 0 and kq are respectively a normalisation frequency and the 
dust mass absorption coefficient evaluated at v 0 (vo =23 0 GHz and 
ko= 0.005 g cm 2 with a gas to dust ratio of 100, iPreibisch et alJ 
11993h . The spectral index /3 has been fixed to /3 = 2. 0, in agree- 
ment with the standard value for cold dust emission eHildebrandl 
1 1983h and with previous studies of starless and protostellar clump s 

(e.g. lMotte et alj20ld : lK6nvves et alj20ld : lveneziani et al.l2013l) . 

The total mass (gas+dust) is M and £2 is the solid angle in which 
the flux has been evaluated. B V (T) is the Planck blackbody function 
at frequency v and temperature T. Finally, d is the distance of the 
source. 

For both starless and protostellar clumps we used the fluxes 
at 160, 250 and 350 pm to model the SED and estimate mass and 
dust temperature. The FIR luminosity is evaluated by integrating 
the flux in the range 70 < A < 500 pm. The 500 pm is extrapolated 
from the SED fit in both the distributions, where the 70 pm flux is 
extrapolated from the fit for starless clumps and is obtained from 
the measured flux for protostellar clumps. The 70 pm flux (and that 
at shorter wavelengths) of the protostellar clumps i s par tly influ- 
enced by the warm core(s) in the centre (e.g jDunham et al]|2008l : 
iMotte et al.l I2010 Il A single temperature greybody model cannot 
account for this second, warm component. Therefore, we do not 
include the 70 pm emission to estimate the protostellar clump dust 
temperature and mass. 

We note that the temperature gradie nts ac ross the whole 
IRDCs can be up to = 10K IPeretto et alii20 id) , but it is neg¬ 
ligible across the starless clumps. Furthermore, the differences in 
the starless clumps mass estimation assuming a single-temperature 
greybody modeler with_a detailed radiative transfer model is negli¬ 
gible dWilcock et al.l201ll) . At the same time, the central regions of 
the protostellar clumps partially warmed-up by the central core(s) 
are not included in the model, therefore the model underestimates 
the protostellar clumps central temperature. As a consequence of 
Equation Q] the model could overestimate the protostellar masses 
(see in Section |4 ~TT i. The adopted single clump-averaged tempera¬ 
ture model is however a reasonable approximation to describe the 
emission arising from the cold, extended dust envelope of both 
types of objects. A more complete analysis of protostellar prop¬ 
erties requires to model at least two different tem peratur e compo¬ 
nents, using more sophisticated models (e.g. lRobitaille et alj2007ll 
and using shorte r wavelengths to constraint the warm component 
(e.g. lMotte et alfeoiol) . 

The best fit for the greybody model is evaluated with a 
chi-s quare minimisation analysis using the mpfit IDL routine 
dMarkwardtll2009h . In order to restrict the analysis to the sources 
with good temperature, mass and luminosity estimates, only 
sources with fits with f 1 < 10 are considered further. To further 
delimit the sample, we excluded few sources with^ 2 <10 but with 
unrealistic temperature values (T> 40 K), likely a consequence of 


We adopt a source size given by the equivalent radius r eq , defined 
as the radius of a circle with the same area, A = nr 2 q , as the ellip¬ 
tical region evaluated from the 250 /rm source fit. Unlike previous 
work on Herschel data we do not assume the deconvolved radius 
to describe the source size, which is a description of the intrinsic 
shape of the source but it is wavelength-depe ndent (e.g. lMotte et al.l 
l20ld : iGiannini et~al]|2012l : lElia et al.ll2013h . Instead, r eq = fAfn 
describes the dimension of the region across which the flux is eval¬ 
uated at all wavelengths. 

The distribution of r eq for both starless and protostellar clumps 
is shown in FigureflOl A Kolmogorov-Smirnov (KS) test shows that 
there is no evidence that the distributions are different. The median 
values of the two distributions are similar, r eq = 0.6 pc. At the 
median distance of our sample of IRDCs (4.2 kpc, see Section[3j} 
the SPIRE 250 pm beam is equivalent to = 0.4 pc, therefore in most 
cases we cannot resolve single cores (r <0.1 pc. lBergin & Tafallal 
l2007t) . The apparent size threshold around R= 1 pc is because the 
vast majority of the sources lie below = 5.5 kpc and we set the 
maximum aperture radius, which defines the equivalent radius R eq 
(see Section l4~0 . to twice the FWHMiso/mi, 36", which corresponds 
to R eq = 1 pc at d=5.5 kpc. This choice of restricting the fit to twice 
the FWHM 250 fim is made to isolate compact structures embedded 
in the clouds and we expect the clumps to be relatively circular, 
with aspect ratios of less than 2 on average (e.g. lUrquhart et al.1 
|2014|) . The few sources with radii r eq > 1 pc are associated with 
the furthest IRDCs of our catalogue, located around d = 10 kpc in 
correspondence of the Perseus arm at / = 50° (see Section [370 . 

The distribution of source temperature for both the starless and 
protostellar clumps are shown in Figure [IT] The median tempera¬ 
tures of the distributions is T=15.5 K and T=17.1 K for starless and 
protostellar clumps respectively. These temperature distributions 
are in agreement with similar analysis of starless and prot ostars 
done with Hi-GAL both in the inne r Galaxy (e.g. IVeneziani et al.1 
1 201 3l) and in the outer Galaxy (e.g. lElia et al.|[2QI3 ). The KS test 
shows that the probability that the two distributions are the same 
is < 1CT 9 , although it is clear that the protostellar clumps are only 
slightly warmer than starless clumps on average. Since the emis¬ 
sion in the 160-350 yum spectral range arises from the dust enve¬ 
lope for both starless and protostellar clumps, this result implies 
that the protostellar envelope is only partially warmed-up by the 
central core(s), which may be an indication of the youth of the pro¬ 
tostellar systems. 

The mass distributions of the sources are shown in Figure 
ED The red vertical line defines M ram =105 M 0 which indicates 
the mass completeness limit as explained in the next Section. This 
limit lies below the turnover point which is around M= 10 3 M 0 . 
The identified sources span a wide range of masses, from few solar 
masses up to few = 10 4 M 0 with average clump mass of -193 Mg 
and =272 M 0 for starless and protostellar clumps respectively. The 
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Figure 9. Fluxes at 70, 160, 250 and 350 //m and SEDs for the 6 protostellar (blue) clumps and the starless (bottom, red) clumps in HGL23.271-0.263. The 
spectral index is fixed to (3 = 2.0. The free parameters of the fit are the temperature and the mass, while the luminosity is obtained integrating the emission in 
the range 70 < A < 500 fim. The^ 2 value of the fit is also shown. 
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IRDC 

source 

type 

R 

(pc) 

M 

(M g ) 

L 

(Lq) 

T 

(K) 

Terr 

(K) 

x 2 

distance 

(kpc) 

23.271-0.263 

1 

protostellar 

0.71 

228 

926 

17.7 

2.1 

0.595 

5.21 

23.271-0.263 

2 

protostellar 

0.76 

1240 

3110 

16.0 

1.6 

0.055 

5.21 

23.271-0.263 

3 

protostellar 

0.73 

152 

1748 

22.7 

3.1 

1.730 

5.21 

23.271-0.263 

4 

protostellar 

0.91 

2537 

8591 

17.7 

1.9 

0.042 

5.21 

23.271-0.263 

5 

protostellar 

0.91 

1628 

6175 

17.4 

1.9 

0.326 

5.21 

23.271-0.263 

6 

protostellar 

0.91 

563 

1073 

15.8 

1.5 

0.129 

5.21 

23.271-0.263 

7 

starless 

0.56 

1148 

200 

11.7 

0.7 

0.777 

5.21 


Table 2. Temperature, mass and luminosity of the 7 clumps identified in HGL23.271-0.263, six protostellar and one starless. Columns 1-2: IRDC name and 
source number. Column 3: protostellar or starless type. Columns 4-6: radius, mass and luminosity of each source. Columns 7-8: temperature and the associated 
error estimated with the mpfit routine. Column 9: x 2 value of the greybody fit. Column 10: source distance, equal to the parent IRDC distance. 



Equivalent radius (pc) 


Figure 10. Equivalent radius distribution of the clumps identified in the cat¬ 
alogue for both starless (red) and protostellar (blue) clumps. The mean val¬ 
ues are 0.61 pc and 0.63 pc for starless and protostellar clumps respectively. 
The radii have been evaluated as the radii of circles with the same area of 
the ellipses estimated from the 2d-Gaussian fits at 250 /./m, as explained in 
detail in the text. 


distributions are similar for both starless and protostellar dumps, 
likely due to the fact that the envelope of these clumps are not sub¬ 
stantially perturbed by the internal gravit ational collaps e which is 
occurring in the protostellar cores flGiannini et al.ll2012r) . Some of 
these clumps are potentially the birth site of high-mass objects, as 
discussed in Sections [4.2l and l4~3l 

Some of the clumps in the sample have also been observed 
as part of the Herschel EPOS survey of high-mass star forming 
IRDCs bv lRagan et alj d2012l) . They found 496 protostellar cores 
and clumps simultaneously observed at 70, 100 and 160 p m associ¬ 
ated with 45 IRDCs across the Galactic Plane. Longer wavelengths 
were excluded from the analysis in ord er to preserve the high spatial 
resolution of the PACS data. In the lRagan et al.1120121) catalogue 64 
protostellar clumps fall in the longitude range 15° < 1 < 55° and 
overlap with our sample. All these 64 sources were detected in our 
analysis at 70/rm, but only 35 of them have observable counterparts 
at 160, 250 and 350 p m and are part of the protostellar clump cat¬ 
alogue. In addition 3 sources have been identified as starless, since 
the 160 pm centroids are located more than 6.7" away from the 70 
/jm source (Section ^. 11 . 


The properties of the two overlapping samples are not straight- 
forw ard to compa re, since the higher spatial resolution of the 
iRagan et al.1 d2012h sample allows them to model the properties of 
the inner part of the clumps, and in some cases of the single cores. 
Indeed in our catalogue the dust temperature are, on average, 5 K 
colder and th e mass are, jm average, a factor of ^ 8 higher com¬ 
pared to the Rag an et alJd 2012 |) catalogue. The mean mass for the 3 
clumps we identify as starless is M^2900 M 0 whereas Ragan et al. 
d2012l) find a mean of 740 M G . Similarly for the the 35 pro¬ 
tostellar clumps, in our catalogue the mean mass is M^1350 M 0 
whereas [Ragan et al] [20121 find M^215 M G . The in clusion of the 
70 p m flux in the SED fitting bv lRagan et al J d 20 1 2 |) significantly 
raises the temperature estimation which contributes to these mass 
differences. The sources have a mean temperature of 14.4 and 18.3 
K for starless and 17.5 and 2L7 K for protostellar clumps, in our 
catalogue and in the lRagan et al.1 [20121 ) catalogue respectively. 

It is important to note that the mass (and mass surface den¬ 
sity) values have significant uncertainties due to the assumed dust 
model. The opacity and the spectral index of the d ust can vary 
among the different models by a factor of 2 or more iOrmel et al .1 
1201 ll) . High-resolution, multi-wavelength observations at sub- 
mm/mm wavelengths are required to better constraint the dust prop¬ 
erties and hence the mass of each clump. 

4.1.1 Mass completeness 

The mass completeness of the catalogue is determined by the abil¬ 
ity to recover faint sources in the presence of highly variable back¬ 
grounds, which vary from cloud to cloud by orders of magnitude. 
Also, we are extracting sources from thousands of relatively small 
clouds located at a range of distances, d, which will have masses 
oc d 2 (see EquationQ}. Therefore a single completeness limit for the 
catalogue is poorly representative of the whole sample of clouds. 
Instead, a mass completeness needs to be evaluated locally for each 
cloud which accounts for the background confusion and the source 
distance. 

In order to derive the local mass completeness for each cloud 
we compared the mean mass of the embedded clumps with the cor¬ 
responding mean background-equivalent source mass Mi, g of each 
cloud. For each clump we define the background-equivalent source 
as a point source with a flux equal to the clump background flux in¬ 
tegrated in a circular region with a radius equal to the FWHIVLsoy,,,, 
at all wavelengths (see Section [T2t . The clump background flux 
per pixel at each wavelength is defined as the average flux emis¬ 
sion per pixel evaluated in a region surrounding each clump, with 
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Temperature (K) 

Figure 11. Temperature distribution of the 1723 clumps, starless (red) and 
protostellar (blue). The median values are T=15.49 K and T=17.12 K re¬ 
spectively, suggesting that the cold dust envelopes are slightly warmed by 
the central core(s) in the protostellar clumps. 



Log(M/M 0 ) 

Figure 12. Mass distribution of the 1723 clumps, starless (blue) and proto¬ 
stellar (red). The dashed red line is the mass completeness, fixed at M=105 
M 0 . The mass range spans several order of magnitudes, from few solar 
masses up to few 10 4 M s . The median values are M=194 M 0 and M=269 
M 0 for starless and protostellar clumps respectively. 

the clump masked and with a sigma-clipping procedure to avoid 
significant contamination from companion sources. 

The background-equivalent source mass is then evaluated us¬ 
ing the same SED fitting procedure described in the previous Sec¬ 
tion and assuming for the background sources the same distance as 
their corresponding IRDC. The mean value of all the background- 
equivalent source masses in each cloud determines the local IRDC 
Mj s . 

The mean mass of the background-equivalent sources com¬ 
pared with the sources in each IRDC is shown in Figure[l3] Sources 
with a mass up to M fcg are hidden in the background emission 
and unidentifiable in that specific cloud. M fcg sets a local mass 
completeness limit which accounts for the local background varia¬ 
tion and the IRDC distance. There are a few tens of clouds with 
M bg > 10 2 M 0 and, at the same time, several clouds for which 
sources with M< 10 M 0 are easily identifiable. On average the 


mean source mass is few times greater than the corresponding mean 
Mj s , however there are identified sources with masses close to the 
corresponding mean M fcg . 

To set an estimate of the mass sensitivity limit for the whole 
catalogue, we identify the mass value M com where 90% of clouds 
have Mj g <M com (the green dashed line in Figure 1131 . In other 
words, in 90% of the clouds all sources with masses M>M com are 
detected. This mass value is M com = 105 M 0 . We define this value as 
a mass completeness limit for the catalogues. The remaining 10% 
of the clouds with M/, g above the limit are among the furthest in the 
catalogue, with a mean distance of 5.4 kpc (in comparison with the 
average distance of the whole catalogue, 4.2 kpc, see Section [3Tl l. 

We stress that this mass completeness limit is strongly in¬ 
fluenced by the IRDC distance and has to be taken “cum grano 
salis”, in general a local mass completeness should be used for 
each cloud. As showed in Figure [T4] the distribution of M& g sub¬ 
stantially changes if we divide the source sample in two subsets, 
one including all the sources located at d < 4.2 kpc and one in¬ 
cluding all the sources located at d > 4.2 kpc. M com (d < 4.2)=37 
M 0 and M com (d > 4.2)= 167 M e for the close and the far sample 
respectively. To further demonstrate that the mass completeness is 
dominated by the diffuse emission and depends on the source dis¬ 
tance, we estimated the mass of a background-equivalent source lo¬ 
cated at the mean distance of d=4.2 kpc and with an emission at 160 
yum arising from residual noise only (0.6 Jy/pixel, iTraficante et al.l 
|201 lb , Its mass is = 10 M e (the orange-dotted line in Figure [T-i]). 
For comparison, the average M 6g of all clouds located at =4.2 kpc 
is =30 M Q , while = 10 M 0 is the mean M; Jg of clouds located at 
= 2.2 kpc. 

Finally, we note that the selection criteria of our sample of 
clouds (see Section [2}, together with the local mass completeness 
discussed in this Section, do not produce a complete sample of ob¬ 
jects. M com gives a reference value which only defines that we re¬ 
cover sources with masses ^ M f „„, in 90% of the clouds and it is 
not related to the underlying intrinsic clump mass spectrum, for 
which a statistically complete sample would be required. As a con¬ 
sequence, for example, it is not clear whether the turnover point in 
the source mass distribution, around M= 10 3 M 0 (see Figure IT2l). is 
meaningful beyond describing the actual distribution of clumps in 
the catalogue. The local mass completeness values for each cloud 
is available with the source catalogues. 

4.2 Mass vs. radius 

In the past it has been suggested that IRDCs are fav ourable 
places for high-mass star formation (e.g. ICarev et aklll998h . This 
suggestion has been corroborated by several observations of 
massive YSOs associated with IRDC s (e.g. iBeuther et al]|2013l : 
iBeltran et alJl2013l : IPeretto et al]|2013l) . However, IRDCs are also 
forming low-to-intermediate stars in regions devoid of high-mass 
stars (e.g. ISakai et all2013l) . 

One way to determine if the clumps associated with the IRDCs 
are ongoing high-mass star formation is to look at the mass-radius 
distribution under the assumption that, in order to form high-mass 
stars, a large amount of material needs to be concentrated in to 
a relatively small volume. Recently an empirical threshold for 
the formation of mass stars in IRDCs has been proposed. This 
threshold is, in its original formulation, m(r) > 870 M 0 (r/pc ) 1 ' 33 
jKaull'mann & Pilial20ld hereafter KP). This model accounts for 
single fragments within each star-forming region and it is well 
suited to be compared with our clumps catalogue. In Figure GH 
we show the mass-radius relationship for our catalogue of starless 
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IRDC mean background mass (Log(M/M 0 )) 


Figure 13. Distribution of the mean source mass per IRDC as function of 
the M^g. The blue line is the y=x line. The majority of the clumps have 
M bg- 



Log(M/M 0 ) 

Figure 14. M/, g distribution for sources located at d < 4.2 kpc (purple 
histrogram) and at d ^ 4.2 kpc (green histrogram). M com , which defines 
the mass value above which all sources are identified in 90% of the clouds, 
changes significantly with distance with values of M co ,„=37 M© for the 
distribution of the closest sources and M co/ „=167 for the distribution of 
the more distant sources. The orange-dotted line shows for comparison the 
mass of a background-equivalent source located at the mean distance of 
d=4.2 kpc and with an emission at 160 /im arising from residual noise only. 

clumps, colour-coded for their dust envelope temperature. The light 
brown shaded area delimits the KP region above which massive 
stars can be formed. For comparison, the figure also shows Larson’s 
third law as it appeared in its original formulation, m(r) > 460 M© 
(r/ pc ) L9 (green dashed line), which describes the universality of 
t he scaling relation between mass and radius of molecular clouds 
dLarso rli 198 ll >. 

We identify 171 starless clumps above the KP threshold, dis¬ 
tributed in 130 IRDCs. Interestingly by this criterion :=26% of the 
starless clumps are likely to form high-mass stars. These are situ¬ 
ated in —33% of the IRDCs with identified starless clumps (389, 
Section [3Tl i. At the same time, the clouds which encompass these 
high-mass starless clumps are only 4% of the total clouds in our 
sample (3493, see Section [2j. While the mass completeness limit 


influences the percentage of high-mass starless clumps, the num¬ 
ber of clouds which embed these high-mass clumps is much less 
affected since it is unlikely that we are missing the most massive 
objects associated with the clouds. We can conclude that only a 
small fraction of IRDCs wil l potentially form high-ma ss stars, in 
agreement with the results of Kauffmann & Pillaij 1201011 . 

The dash-dotted red line in Figure 031 define s the e mpirical 
threshold for high-mass star formation proposed bv lUrauhart et al .1 
i2014ll . This threshold is based on the analysis of high-mass star 
forming clumps identified in the ATLASGAL survey and corre¬ 
sponds to clumps with a constant mass surface density of E = 0.05 
g cm -2 . This threshold however sets less stringent constraints than 
the KP threshold for clumps with an equivalent radius of R< 1 pc, 
which are the majority of the clumps in our survey. We discuss in 
the next Section the mass surface density thresholds for high-mass 
star formation compared with our sample. 

The mass-radius diagram for protostellar clumps is shown in 
Figure |T6] A direct comparison of the protostellar clump sample 
with the KP threshold is less meaningful however, since the mass 
estimation of protostellar clumps can be affected by mass-loss in 
jets or outflows, or mass segregated in the central core(s) as well 
as overestimating the mass as a result of using a single temperature 
model and incompletely accounting for the heating by the central 
protostars. However, we find 456 protostellar clumps (=43% of the 
total) associated with 291 IRDCs (==50% of the IRDCs with em¬ 
bedded protostars, and = 8 % of the total) above the KP threshold 
and so are capable of forming high mass stars. 

The low percentage of IRDCs possibly forming high-mass 
stars does not exclude however that the majority of the high-mass 
star formation activity is associated with IRDCs. This issue can 
be addressed, for example, by comparing the star formation activ¬ 
ity within the IRDCs and in a large sample of massive molecular 
clouds not associated with IRDCs, or tracing the star formation rate 
inside and outside IRDCs. Both these analyses are the main topic 
of a forthcoming work. 

The temperature distribution in the diagrams show that the 
colder clumps are the bigger and the more massive. For starless 
clumps this may be an indication of their evolutionary phase, with 
the smaller clumps having already started central accretion and 
likely being just at the verge of a protostar core formation, or maybe 
with a very young protostar already formed but still obscured at 
70 yum. For protostellar clumps this is likely a consequence of 
the adopted model, since the single clump-averaged temperature is 
dominated by the extended cold dust envelope emission with less 
contribution from the warmer central regions as the clump size in¬ 
creases. 

4.3 Mass and Mass surface density 

The environment of massive star formation can be also investigated 
through the mass surface de n sity, E , versus mass diagram. The 
work of iKrumholz & McKee) (t2008h suggested E = 1 g cm -2 as 
a mass surface density threshold required to prevent fragmentation 
into low-mass cores as a result of radiative feedback, thus allowing 
high-mass star formation. However, this threshold has a large scat¬ 
ter and the calculation does not include the contribution of magnetic 
fields, which can prevent fragme ntation without imposing a mini¬ 
mum mass surface density (e.g. lButler & Tanil2012h . Several high- 
mass clumps and cores are indeed observe d with 0.1 < E < 0.5 g 
cm -2 dButler & Tanll20 1 1 : iTan et al]|2013l) . One of the most mas¬ 
sive cores observed in the inner Galaxy, associated with the IRDC 
SDC335, with a mass of = 500 M 0 in = 0.05 pc (Peretto et alj 
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Figure 15. The mass of the starless clumps as function of their equiva¬ 
lent radius, colour-coded for the temperature of the dust envelope. The 
mass increases with increasing equivalent radius. The unshaded area de¬ 
limits the region of the high-mas s star formation in IRDCs as determined 
by the lKauffmann & PillaJ fcOld) empiric analysis. Almost one third of the 
starless clumps (171 out of 667) lie a bove this thresh old. The line-dotted 
red line is the threshold adapted from lUrauhart et ahl 120141) who propose 
an empirical lower limit value for high-mass star formation equivalent to a 
const ant mas s surfa ce density of X = 0.05 g cm -2 . The green dashed line 
is the L arso 5 fl98l universal scaling relation between mass and radius of 
molecular clouds, shown for comparison. 
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Figure 16. The mass of the protostellar clumps as function of their 
equivalent radius, colour-coded for the temperature of the dust envelope. 
There are ^43% of the protostellar clumps (456 out of 1056) above the 
iKauffmann & Pillail 1201 Ol) threshold. 

120131) reaches £ ^ 60 g cm 2 . However the mass surface density 
of the embedding clump, with a mass of = 2600 M Q in = 1.2 pc 
( Peretto et al.1120131) , is E = 0.5 g cm ' 2 . The dark cloud, with a 
mass of = 5500 M Q within = 2.4 pc has a mean surface density 
of E = 0.25 g cm ~ 2 . On average, high-mass star forming regions 
associated with IRDCs are usually observed within 0.1 E < 1 
g cm ' 2 ban et al.ll2014l ). As discussed in the previous Section, a 
less stringent empirical threshold for high-m ass s tar formation in 
clumps has been suggested bv lUrquhart et all d2014D based on AT- 
LASGAL clump studies and corresponds to E > 0.05 g cm' 2 . 

In Figures[T7]and[T8]we plot the mass surface density-mass di¬ 
agrams for the starless and protostellar clumps respectively, colour- 


coded for the temperature of the dust envelope. The light-brown 
shaded area delimits the E > 0.1 g cm ' 2 region. There are 144 
starless and 308 protostellar clumps, respectively 22% and 29% of 
the two samples, within this region where massive star formation 
is possible. Adopting the lUrquhart et all 120141) threshold limit in¬ 
creases these numbers to 241 starless and 542 protostellar clumps 
(=36% and =51% of the respective samples). Following lTan et ahl 
d2014l) . we also show straight lines corresponding to constant ra¬ 
dius, constant escape speed and constant hydrogen number density. 
The apparent radius threshold at R= 1 pc is due to our choice of the 
maximum aperture size of clumps and the distance distributions of 
clouds, as discussed in Section l4~Tl 

The escape speed is v esc = (10/a v ,>) 1/2 cr, with a vlr being 
the virial parameter and a the velocity dispersion of the clump. 
Assuming a virial parameter a vjr = 1 for gravitationally bounds 
clumps and an average velocity dispersion of cr = 1 km/s (e.g. 
Hens haw et al.ll20l3l jKauffman n~etaDl2013h the escape speed is 
v esc = 3 km/s. Therefore, for a typical high-mass star form¬ 
ing clump with mas s of M= 1000 M G dTackenberg et al.l 120121 : 
lUrouhart et alj|2()14r) . the clump is bound if its mass surface den¬ 
sity reaches E = 0.1 g cm' 2 . Low values of viral parameters have 
been observed in reg ion of high-mass star formation fe.g. lTan et al.l 
120131 : IlI et alj|2013h . Assuming a V i r = 0.5 the corresponding es¬ 
cape speed becomes v esc = 4.5 km/s and a M= 1000 M 0 clump 
will be bound only if reaches E = 0.3 g cm ' 2 or, equivalently, if its 
mass is confined in a radius R= 0.45 pc. We find 38 starless and 76 
protostellar clumps with E ^ 0.3 g cm' 2 , = 6 % and = 7% of the 
two samples respectively. We find only 4 starless and 4 protostellar 
clumps with E > 1 g cm' 2 . 

The hydrogen number density n H is analogous to constant 
free-fall time, following the relation t/f = [37r/(32Gp)] 1/2 , where p 
is the volume density of the clump. From the diagram we note that, 
setting the high-mass star formation threshold at E 3* 0.3 g cm' 2 , 
the minimum hydrogen number density of high-mass star-forming 
clumps is n H = 10 5 cm' 3 . This value corresponds to a free-fall time 
of 1.4 x 10 5 year. Assuming as a lower limit for the mass surface 
density E > 0.1 g cm ' 2 to form high-mass stars, from the diagram 
we deduce a minimum hydrogen number density of n H = 1.5 x 10 4 
cm' 3 , or a free-fall time of 3.6 x 10 5 year. With the surface density 
proposed by dUrouhart et al.ll2ol4l) . from our clumps distribution 
we deduced a minimum hydrogen number density of n H = 8 x 10 3 
cm' 3 , or a free-fall time of 5.0 x 10 5 year. 


For comparison we highlighted the regions in the plots oc¬ 
cupied by c ores/clumps a ssociated with a sample of 10 IRDCs 
selected by iButler & Tanl d2012t). the 5 1 massive star forming 


clumps identified by 

Mueller et al. (2002 

) and the 13 CO molecular 

clouds identified in 

Rnman-Duval et al. ( 

201(J). The SDC335 cen- 


tral clump and cloud values are also showed in the plot with dark 
grey asterisks. 


4.4 Luminosity 

Figure [19] shows the FIR luminosity distributions of the clumps. 
The median values of the luminosities are L=83 L 0 and L=590 L e 
for starless and protostellar clumps respectively. This difference in 
the FIR luminosity is principally due to the emission arising from 
the warm core(s) in the protostars. In the starless clumps the lumi¬ 
nosity of the dust envelope is the only contribution to the bolomet- 
ric luminosity. On the other hand, the warm core(s) in the proto¬ 
stellar clumps contributes substantially to the integrated FIR lumi¬ 
nosity. Depending on the evolutionary state of a protostar there can 
also be significant emission at 24 /mi or shorter wavelengths. The 
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Figure 17. Mass surface density as function of mass for starless clumps, colour coded for the temperature of the dust envelope. The light brown shaded area 
defines the region of sources with surface density S > 0.1 g cm -2 . The lines correspond to constant values of various parameters: red lines : constant radius; 
blue lines : constant H nu mber density, analogous of constant free-fall time; green lines : constant escape velocity. Also shown are t he r egion s corre sponding to 
IRDC cores observed bv lButler & Tanl J2012L blue shaded a rea), th e massive sta r form ation clumps (and cores) observed bv lMueller et alJ d2QQ2l red shaded 
area) and the 13 CO molecular clo uds from the catalo gue of Roman- Duval et al] J201ol green shaded area). The dark grey asterisks show the location of the 
SDC335 central clump and cloud IPeretto et alJboiT l. 


estimated FIR luminosity is therefore a lower limit of the protostars 
bolometric luminosity (e.g. lGiannini et al . 20121). 

We identified 71 protostellar clumps {=±1% of the total) with 
h> 10 4 L q . There are 52 starless clumps with 10 3 L 0 , relatively 
high for starless clumps. The majority of them (30 out of 59) have 
a bolometric luminosity vs. envelope mass {hi,„i / M, m .)< 1, in¬ 
dicative of objects without warm cores. These are among the most 
massive starless clumps identified in this catalogue. The remaining 
22 starless clumps with luminosity h> 10 3 L 0 and L toi / M em , > 1 
are either very young protostellar clumps or not confirmed starless, 
as from visual inspections some of them show them to be are as¬ 
sociated with particularly faint 70/rm sources not identified with 
our selection threshold (see Section [3~Tb or associated with bright 
diffuse 70 /tm emission. 


4.5 Mass-Luminosity 

A potentially useful tool to infer the evolutionary properties of the 
young clumps is the L toi - M e „ v diagram. This diagram has been 
successfully used in the p ast to describe the evolutionary phases 
of low-mass objects I lSaraceno et al.lll996h . and subsequently ex¬ 
tended to predict th e evolutionary paths of high-mass objects 
dMolinari et al.l[2008h . A source evolves following specific tracks, 
depending on its initial mass and luminosity, and the model pre¬ 
dicts the mass and luminosity of the object when it reaches the zero 


age main sequence (ZAMS). F or high-mass objects the paths fol¬ 
low the two-phases model of IMcKee & Tanl <20030 . According to 
this model, when a cloud starts its gravitational collapse the mass 
slightly decreases due the accretion and molecular outflows, while 
the luminosity of the object increases significantly, sustained by the 
collapse. The source moves along almost vertical paths in the dia¬ 
gram. At the end of this phase the object is surrounded by an HII 
region, the luminosity remains constant while the mass is expelled 
through radiation and molecular outflows. The object then follows 
an almost horizontal path, which will ends when it becomes visible 
in the optical and almost all the envelope has been expelled. Ac¬ 
cordingly to the evolutionary phase in the diagram, the high-mass 
objects are classified in analogy with the low-mass regime classifi¬ 
cation, from Class 0 to Class II prior to the ZAMS phase. This clas¬ 
sification can be misleading though, and some caveats of this model 
have to be kept in mind, in particular when applied to our sample: 
1) the transition from Class 0 to Class I and Class II sources in the 
low-mass regime is not sharp. Sources classified as Class 0 based 
on the ir NIR-MIR fluxes can instead be clas sified as Class I in the 
FIR (Hennemann et al . 20ld : lElia et al.l[2013h : 2) the evolutionary 
tracks have been initially modelled for single cores, not for clumps 
and 3) in high-mass regions the observed clumps can contain mul¬ 
tiple cores in different stages of evolution and the clump luminosity 
is likely dominated by the most evolved core(s). Nevertheless, the 
L toi - M e „ v diagram has been used in the past to discuss the evo¬ 
lutionary tracks of YSOs with Herschel data in a wide range of 
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Figure 18. Mass surface density as function of mass for protostellar clumps, colour coded for the temperature of the dust envelope. The shaded areas and lines 
are as shown in Figure [171 
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Figure 19. FIR luminosity distributions for the 1723 clumps, evaluated in 
the range 70 < A < 500 // m as detailed in the text. The median values are 
L=83 L 0 and L=590 L 0 for starless and protostellar clumps respectively. 
This difference is a direct consequence of the internal luminosity of the 
core(s) embedded in the protostellar clumps which are partly visible in the 
70 jum emission. 


ISaraceno et all l l 1 9961) and lMolinari et all 12008!) . which shows the 
tracks for stars with final masses of 6.5, 8, 13.5, 18 and 35 Mg from 
left to right respectively The best l og-lo g fit for Class I and Class 
0 high-mass objects dMolinari et al.ll2008 ) are also shown. The ma¬ 
jority of the sources lie below the best log-log fit of Class 0 ob¬ 
jects. Also, the distribution lies on average below the distribution of 
young high-mass clumps identified in lUrquhart et all 120141) . which 
includes clumps at different stages of evolution but does not include 
starless ones. We find a m ean L^,; / M wn , - 1.1, compared with a 
mean L bot - M e „ v ^ 10 of llJrauhart et alj l20l4) . consistent with 
the majority of these starless clumps being in a very early stage of 
their evolution. Figure |2T| shows the L- M„ n , diagram for the 
protostellar clumps. The distribution overlaps that of the starless 
clumps, proving little discrimination between the relative evolu¬ 
tionary stages of the sources. However the mean L bo i - M,,„„ = 6.6 
is an indication of a more evolved phase and including measure¬ 
ments at shorter wavelengths will increase the luminosity of many 
sources (Section[4]l and raise the distribution in the diagram. In ad¬ 
dition, higher angular resolution measurements which can resolve 
the actual envelope mass (as opposed to the clump mass) associ¬ 
ated with each protostar, will move the points to the left. Without 
these additional data, the high degree of overlap in the distributions 
limits the value of these plots for probing the evolutionary status of 
the sources. 


different conditions, bot h for low-mass (e.g. lBontemps et ai1l20ld : 
iHenne mann et al. 201( 3) and for intermedia te-to-high mass regime 
fe.g. lElia et all2()7(il : IVeneziani et alJl2013h . 

The L bo , - M,,,,,, diagram for the starless clumps is shown 
in Figure [20] The distribution is superimposed to the models of 


4.6 Clumps lifetime 

Comparing the number of starless to protostellar clumps provides 
an estimate of the lifetime of the starless phase. This is of particular 
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Figure 20. L- M f „ v diagram for s tarless clumps superimposed with 
the model of Saraceno et al. 1 1996) and Molinari et al. 1 2008, green lines). 
The solid and dashed lines are the best log-log fit for Class I and C lass 0 
sources respectively, extrapolated in the high-mass regime bv lMolinari et alj 
idOOSh - The red vertical line is our mass completeness limit, M=105 M e . 
The sources are sparse in the diagram but the majority are below the Class 
0 fit, showing that they are in a very early stage of evolution. 



Figure 21. The same diagram of Figure l20l but for protostellar clumps. 


interest for those regions most likely to form high mass stars, the 
most massive clumps with high surface densities. For clumps with 
a mass M> 500 M 0 and a surface density X > 0.1 g cur 2 the ra¬ 
tio of the number of starless clumps to protostellar clumps is 0.46, 
which increases only slightly to 0.48 when considering clumps with 
a surface density X > 0.3 g cnr 2 . This suggests a lifetime for the 
starless phase of about half that of the protostellar phase. Adopt¬ 
ing an estimate of fewxl0 5 yr for the lifet ime of the embedded 
phase of young massive stars (e.g. lDavies et al J[201 ll) . the starless 
phase duration would be ~ 10 5 yr, in agreement with the lifetime 
estimation of starless c lumps identified in the ATLASGAL survey 
dTackenberg et alj|2012h . Since some starless clumps may contain 
currently unidentified protostars as suggested by the high luminos¬ 
ity to mass ratio for some of the apparently starless clumps (Section 
14.4b . this lifetime represents an upper limit. 


5 CONCLUSION 

We have produced a catalogue of starless and protostellar clumps 
identified in a sampl e of 3493 IRDCs extracted from the 
IPeretto & FulleJ (2009 1 catalogue in the region of the Galactic 
Plane delimited by 15° < / < 55°, \b\ < 1°. Using the Herschel 
observations acquired as part of the Hi-GAL survey, we first iden¬ 
tified the FIR counterparts of the IRDCs in the Hi-GAL data at 70, 
160, 250 and 350 pn i. The compact sources were extracted from the 
Hi-GAL IRDCs counterparts using Hyper , a new algorithm which 
combines the advantages of the aperture photometry with a min¬ 
imal use of source modelling to take into acco unt the high vari - 
able background and the source crowding jTraficante et al.ll2015h . 
The sources were initially identified at 160 pm and we found 5393 
160 pm sources distributed in 1640 IRDCs (=47% of the initial 
sample). All the sources with a counterpart at 250 and 350 pm, 
but no counterpart(s) at 70 pm are identified as starless. If at least 
one 70 pm counterpart is present, they are identified as protostellar 
clumps. We identified 1056 protostellar in 586 IRDCs (=47%) and 
667 starless clumps in 389 IRDCs (=11%). There are 211 IRDCs 
which contain both starless and protostellar clumps. 

Assuming for each source the distance of its parent cloud we 
evaluated the equivalent radius, temperature, mass and luminosity 
of the sources dust envelope, modelling the Hi-GAL fluxes in the 
range 160-350 pm with a single-temperature greybody model and 
a fixed spectral index of /J = 2.0. The adopted model is a good 
approximation to describe the cold envelopes with only small tem¬ 
perature gradients across the clumps, but it does not account for the 
emission arising from the central, warmer regions of protostars. As 
a consequence it underestimates the average dust temperature (and 
therefore overestimates the mass) of the protosteallr clumps. The 
mean equivalent radius is = 0.6 pc for both the starless and pro¬ 
tostellar objects, representative of clumps. This is probably driven 
by our limited capacity of resolving smaller clumps of cores. In¬ 
deed the mean distance of the clumps, 4.2 kpc, does not allows us 
to resolve objects smaller than = 0.4 pc on average. The distribu¬ 
tions of the source temperatures are different, as demonstrated by 
a KS test, however the mean temperature values differ for less than 
2 K, T=15.5 K and T=17.1 K for starless and protostellar clumps 
respectively. The mass distributions are also similar, with median 
values of 193 M 0 and 272 M 0 respectively, and extending up to 
= 10 4 and = 2 • 10 4 M 0 for the more massive starless and pro¬ 
tostellar clumps respectively. The similarities in temperature and 
mass indicate that the dust envelope of the protostellar clumps has 
been not yet warmed-up by the inner core(s) and it is relatively 
unaffected by the gravitational collapse occurring in the core(s). 
Both these conclusions indicates that the population of protostellar 
clumps observed in this sample of IRDCs is very young. 

We demonstrated that, due to the heterogeneity of the cloud 
structures, the mass completeness has to be assumed locally for 
each cloud. We have however set a reference mass completeness of 
M=105 M 0 for the whole sample, which corresponds to the mass 
above which clumps can be detected in 90% of the clouds. 

The mass vs. radius distribution of the starless clump is used 
to i dentify the clumps which m ay lead to high-mass stars, assuming 
the lKauffmann & Pi 11 ail (1201 ()|) empiric threshold for high-mass star 
formation. We show that almost one third of the starless clumps 
(171 out of 667), distributed in 130 IRDCs (only 4% of the whole 
sample), are above the high-mass stars threshold. In other words 
only a small fraction of the clouds are likely to form massive stars. 

The environment of massive star formation has been further 
investigated through the mass vs. surface density diagram. Assum- 
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ing th at high-mass star formation occurs i n regions with E ^ 0.1 g 
cm -2 iButler & TarJl20lj|Tan et~ai]|2014l) we identify 144 starless 
and 308 protostellar clumps that will likely form high-mass stars. 
From the mass surface density diagram of the starless clumps we 
identify the maximum free-fall time of our sample of t= 4.4 x 10 5 
year. Fixing th e mass su rface density threshold to S > 0.05 g cnr 2 , 
as proposed bv lUrquhart et all 12014) . this time rises to t=; 5.Ox 10 3 
year. The numbers of massive (M> 500 M Q ), high surface density 
starless clumps and those already containing embedded sources in¬ 
dicate an upper limit lifetime for the starless massive clump phase 
of ~ 10 5 years, similar to the free-fall time. 

The FIR luminosity distributions dilfers with mean values of 
L=83 L g and L=590 L e for starless and protostellar clumps re¬ 
spectively, principally because the protostellar clumps luminosity 
is highly influenced by the luminosity of the embedded core(s). A 
total of 71 protostellar clumps, = 7% of the sample, have FIR lumi¬ 
nosity > 10 4 L 0 . The FIR luminosity is a good approximation for 
the bolometric luminosity only for starless clumps, and we com¬ 
pared the starless L hol - M env diagram with the evolutionary mod- 
els for lo w- and high-mass obje cts developed by ISaraceno et al] 
dl996i) and lMolinari et al] d2008h . The starless clumps distribution 
is sparse but most of the sources are below the line which de¬ 
scribes the high-mass counterparts of low-mass Class 0 regime. 
Although a clear analogy with the low-mass classes classification 
is not straightforward in the high-mass regime, the mean h bo i / 
M env - 1.1 of the distribution shows that the starless clumps are 
in the very early stages of their evolution. 

These catalogues provide a significant sample to identify in¬ 
teresting sources to study in detail the mechanisms involved in the 
early stages of star formation. Future work will extend the cata¬ 
logue to the whole sample of IRDCs in the Galactic Plane and will 
constraint the dust properties of these clumps, identifying counter¬ 
parts at shorter wavelengths, as well as in the high-resolution sub- 
mm/mm surveys of the Galactic Plane such as ATLASGAL and the 
BGPS. 

The catalogues are freely available at the webpage 
http://www.irdarkclouds.org. 
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